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Abstract. Design of experiments is a branch of statistics that aims to 
identify efficient procedures for planning experiments in order to opti- 
mize knowledge discovery. Network inference is a subfield of systems 
biology devoted to the identification of biochemical networks from ex- 
perimental data. Common to both areas of research is their focus on 
the maximization of information gathered from experimentation. The 
goal of this paper is to establish a connection between these two areas 
coming from the common use of polynomial models and techniques from 
computational algebra. 



1. Introduction 

Originally introduced in 0], the field of algebraic statistics focuses on the 
application of techniques from computational algebra and algebraic geome- 
try to problems in statistics. One initial focus of the field was the design of 
experiments, beginning with An early exposition of a basic math- 

ematical relationship between problems in the design of experiments and 



computational commutative algebra appeared in The basic strategy of 
[m and other works is to construct an algebraic model, in the form of a 
polynomial function with rational coefficients, of a fractional factorial de- 
sign. The variables of the polynomial function correspond to the factors of 
the design. One can then use algorithmic techniques from computational 
commutative algebra to answer a variety of questions, for instance about 
the classification of all polynomial models that are identified by a fractional 
design. 

If pi, . . . , Pj. are the points of a fractional design with n levels, then the 
key algebraic object to be considered is the ideal of points I that contains all 
polynomials with rational coefficients that vanish on all pj. (See the appen- 
dix for a review of basic concepts from commutative algebra.) The form of 
the polynomials in different generating sets of this ideal is of special interest. 
In particular, we are interested in so-called interpolator polynomials which 
have a unique representation, given an explicit choice of generating set. An 
interpolator polynomial /(xi, . . . , rE„) has the property that if 61, . . . , 6^. is a 
response to the design given by the Pi, then f{pi) = hi. 
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Strikingly similar constructions have been used recently to solve an en- 
tirely different set of problems related to the inference of intracellular bio- 
chemical networks, such as gene regulatory networks, from experimental 
observations. Relatively recent technological breakthroughs in molecular bi- 
ology have made possible the simultaneous measurement of many different 
biochemical species in cell extracts. For instance, using DNA microarrays 
one can measure the concentration of niRNA molecules, which provide in- 
formation about the activity levels of the corresponding genes at the time 
the cell extract was prepared. Such network-level measurements provide the 
opportunity to construct large-scale models of molecular systems, including 
gene regulatory networks. 

Here, an experimental observation consists of the measurement of n dif- 
ferent quantities at r successive time points, resulting in a time course of 
n-dimensional real- valued vectors pi , . . . , . The number r of experimental 
observations is typically very small compared to the number n of quantities 
measured, due in part to the considerable expense of making measurements. 
In recent years there has been tremendous research activity devoted to the 
development of mathematical and statistical tools to infer the entire network 
structure from such a limited set of experimental measurements. 

Inferring networks from data is a central problem in computational sys- 
tems biology, and several approaches have been developed using a variety of 
approaches. Models range from statistical models such as Bayesian networks 
to dynamic models such as Markov chains and systems of differential equa- 
tions. Another modeling framework is that of finite dynamical systems such 
as Boolean networks. A method proposed in 0] uses such data to construct 
a multi-state discrete dynamical system 

/=(/!,...,/„): A;" ^ A;" 

over a finite field k such that the coordinate functions fi are polynomials 
in variables xi, . . . ,Xn corresponding to the n biochemical compounds mea- 
sured. The system / has to fit the given time course data set, that is, 
f{pi) = Pi+i for i = 1, . . . , r — 1. The goal is to infer a "best" or most likely 
model / from a given data set which specifies a fraction of the possible state 
transitions of /. An advantage to working in a finite field is that all functions 
k"' ^ k are represented by polynomials. An important, and unanswered, 
question is to design biological experiments in an optimal way in order to 
infer a likely model with high probability. One complicating factor is that 
biochemical networks tend to be highly nonlinear. 

In this paper, we describe the two approaches and point out the similari- 
ties between the two classes of problems, the techniques used to solve them, 
and the types of questions asked. 

2. Design of experiments 

In this section we provide a description of the computational algebra 
approach to experimental design given in 0;llH- Let P be the full factorial 
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design with n factors. We make the additional simphfying assumptions that 
each factor has the same number p of levels, resulting in p" points for D. A 
model for the design is a function 

that is, / maps each point of I? to a measurement. Instead of using the 
field Q for measurements, one may choose other fields such as C or a finite 
field. From here on we will denote the field by k. It is well-known that any 
function from a finite number of points in to k can be represented by a 
polynomial, so we may assume that / is a polynomial in variables xi, . . . ,Xn 
with coefficients in k. 

Definition 2.1. A subset = {pi, . . . , p^} C P is called a fraction of V. 

We list three important problems in the design of experiments: 

(1) Identify a model for the full design D from a suitably chosen frac- 
tion J^. 

(2) Given information about features of the model, such as a list of the 
monomials (power products) appearing in it, design a fraction 
which identifies a model for V with these features. 

(3) Given a fraction !F, which models can be identified by it? 

These problems can be formulated in the language of computational al- 
gebra making them amenable to solution by techniques from this field. The 
fraction is encoded by an algebraic object I {J-), an ideal in the polyno- 
mial ring k[xi, . . . ,Xn]- This ideal contains all those polynomial functions 
g G k[xi, . . . , Xn] such that g{pi) = for all i = 1, . . . , r. It is called the ideal 
of points of the Pi and contains all polynomials confounded by the points 
in J^. Here we assume that the points are distinct. We will see that one 
can draw conclusions about from its ideal of confounding polynomials. In 
particular, since any two polynomial models on that differ by a confound- 
ing polynomial are identical on JF, it is advantageous to choose models from 
the quotient ring R = k[xi, . . . ,x„]//(jF) rather than from the polynomial 
ring itself. 

It can be shown that the ring R is isomorphic to the vector space k'^, and 
we need to study possible vector space bases for R consisting of monomials. 
This can be done using Grobner bases of the ideal I (J-) (see the appendix). 
For each choice of a term order for k[xi, . . . , Xn], that is, a special type 
of total ordering of all monomials, we obtain a canonical generating set 
G = {gi, . . . , gs} for I (J-)- We obtain a canonical /c-basis for the vector 
space R = k^ hy choosing all monomials which are not divisible by the 
leading monomial of any of the gi. We can then view each polynomial in R 
as a fc-linear combination of the monomials in the basis. 

To be precise, let {Ti, . . . , Tt} be the set of all monomials in the variables 
xi, . . . ,Xn which are not divisible by the leading monomial of any g^. Then 
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each element f £ R can be expressed uniquely as a fc-linear combination 

t 

i=i 

with Qj € k. Suppose now that we are given a fractional design JF = 
{pi, . . . ,Pr} and an experimental treatment resulting in values f{pi) = hi 
for z = 1, . . . , r. If we now evaluate the generic polynomial / at the points Pi, 
we obtain a system of linear equations 

aiTi(pi) + . . . + atrt(pi) = 61, 

aiTi{i>r) + ■ ■ ■ + atTtiVr) = 

We can view these equations as a system of linear equations in the vari- 
ables aj with the coefficients Tj(pi). We now obtain the main criterion for 
the unique identifiability of a model / from the fraction J^. 

Theorem 2.2. 0, Thru. 4.12] Let X = {pi,...,pr} be a set of dis- 
tinct points in A;", and let f be a linear model with monomial support 
S = {Ti,... ,Tt}, that is, f = Y^-aiTi. Let X{S,X) be the (r x t)-matrix 
whose {i,j)-entry is Tj(pj). Then the model f is uniquely identifiable by X 
if and only if X{S, X) has full rank. 

In this section we have given a brief outline of a mathematical framework 
within which one can use tools from computational algebra to address the 
three experimental design problems listed above. In the next section we will 
describe a similar set of problems and a similar approach to their solution 
in the context of biochemical network modeling. 

3. Biochemical network inference 

Molecular biology has seen tremendous advances in recent years due 
to technological breakthroughs that allow the generation of unprecedented 
amounts and types of data. For instance, it is now possible to simultaneously 
measure the activity level of all genes in a cell extract using DNA microar- 
rays. This capability makes it possible to construct large-scale mathematical 
models of gene regulatory and other types of cellular networks, and the con- 
struction of such models is one of the central foci of computational systems 
biology. The availability of obtaining experimental measurements for large 
numbers of entities that are presumed to be interconnected in a network 
drives the need for the development of network inference algorithms. We 
will focus on the mathematical aspects of this problem for the rest of the 
section. More biological background can be found in 0]. 

We consider a dynamic network with n variables xi, . . . , Xn- These could 
represent products of n genes in a cell extract from a particular organism, 
say yeast. It is known that cellular metabolism and other functions are 
regulated by the interaction of genes that activate or suppress other genes 
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and form a complex network. Suppose we are given a collection of pairs of 
simultaneous measurements of these variables: 

(pi,qi), . . . , (Pr,qr), 

with pi,qi points in R". For gene networks, each of these measurements 
could be obtained from a DNA microarray. Each pair (pi,qi) is to be in- 
terpreted as follows. The variables in the network are initialized at pj and 
subsequently the network transitions to q^. This might be done through 
a perturbation such as an experimental treatment, and pj represents the 
network state immediately after the perturbation and q^ represents the net- 
work state after the network has responded to the perturbation. Sometimes 
the measurement pairs are consecutive points in a measured time course. In 
this case the pairs above consist of consecutive time points. Typically the 
number n of variables is orders of magnitude larger than the number r of 
measurements, in contrast to engineering applications where the reverse is 
true (OR where r is on the order of n). For instance the network may con- 
tain hundreds or thousands of genes, from which only 10 or 20 experimental 
measurements are collected. 

Example 3.1. Consider the following time course for a biochemical network 
of 3 genes, labeled xi,X2, and x^. 



Xi 


X2 


X3 


1.91 


3.30 


1.98 


1.50 


1.42 


1.99 


1.42 


1.31 


0.03 


0.83 


1.96 


1.01 


0.97 


2.08 


1.01 



Each gene's expression levels were measured at 5 consecutive time points and 
each entry represents a measurement. While the data are given in tabular 
form, we could have also represented the data as the pairs of network states 

((1.91, 3.30, 1.98), (1.50, 1.42, 1.99)) 
((1.50, 1.42, 1.99), (1.42, 1.31, 0.03)) 
((1.42, 1.31, 0.03), (0.83, 1.96, 1.01)) 
((0.83, 1.96, 1.01), (0.97, 2.08, 1.01)) . 

Network inference problem. Given input-output measurements {(pi, qi)}, 
infer a model of the network that produced the data. 

One can consider a variety of different model types. First it is of interest to 
infer the directed graph of causal connections in the network, possibly with 
signed edges indicating qualitative features of the interactions. Dynamic 
model types include systems of differential equations, Boolean networks, 
Bayesian networks, or statistical models, to name a few. In light of the fact 
that DNA microarray data contain significant amounts of noise and many 
necessary parameters for models are unknown at this time, it suggests itself 
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to consider a finite number of possible states of the variables Xi rather than 
treating them as real-valued. This is done by Bayesian network inference 
methods, for instance. The issue of data discretization is a very subtle 
one. On the one hand, discrete data conform more to actual data usage by 
experimentalists who tend to interpret, e.g., DNA microarray data in terms 
of genes fold changes of regulation compared to control. On the other hand, 
a lot of information is lost in the process of discretizing data and the end 
result typically depends strongly on the method used. In the extreme case, 
one obtains only two states corresponding to a binary ON/OFF view of gene 
regulation. In our case, a strong advantage of using discrete data is that it 
allows us to compute algorithmically the whole space of admissible models 
for a given data set, as described below. Nonetheless, the result typically 
depends on the discretization method and much work remains to be done 
in understanding the effect of different discretization methods. Once the 
variables take on values in a finite set k of states, it is natural to consider 
discrete dynamical systems 

As mentioned, the dynamics is generated by repeated iteration of the map- 
ping F. In order to have mathematical tools available for model construction 
and analysis, one can make the assumption that k is actually a finite field 
rather than simply a set. In practice this is easily accomplished, since the 
only ingredient required is the choice of a finite state set that has cardinality 
a power of a prime number. With these additional assumptions our models 
are polynomial dynamical systems 

F={h,...,U):k^ ^k\ 

with fi E k[xi, . . . , Xn] for i = 1, . . . ,n. (As remarked above, any function 
from a finite set of points into a field can be represented as a polynomial 
function.) The i-th. polynomial function fi describes the transition rule for 
gene X£ and hence fi is called the transition function for X£. 

Returning to the network inference problem, we can now rephrase it in 
the following form: Given the state transitions {(pj,qj)}, find a polynomial 
dynamical system (or polynomial model) F such that F{pi) = q^. 

This problem can be solved one node at a time, that is, one transition 
function at a time. This "local" approach to inference then begins with a 
collection {pi} of points, and we are looking for transition functions fi G 
k[xi, . . . , Xn] that satisfy the condition that fe{pi) = bi, where bi is the i-th 
entry in q^. 

Example 3.2. Let 

(pi,qi) = ((2,2,2), (1,0,2)), 

(P2,q2) = ((1,0, 2), (1,0,0)), 

(P3,q3) = ((1,0,0), (0,1,1)), 

(P4,q4) = ((0,1,1), (0,1,1)). 
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be the discretization of the data in Example 13.11 into the 3-element field 
A; = F3 by discretizing each coordinate separately, according to the method 
described in Q]- Then the goal is to find a polynomial model F : — > 
such that F{pi) = for i = 1, . . . , 4. Since any such F can be written as 
F = (/i,/2,/3), we can instead consider the problem of finding transition 
functions fi : — > k such that fe{pi) = (la, for all 1 < ^ < 3 and 
1 < i < 4. 

The similarity to the problem about experimental design in the previ- 
ous section is now obvious. Factors correspond to variables Xi representing 
genes; levels correspond to the elements of the field k representing gene 
states; the points Pi of the factorial design correspond to experimental mea- 
surements; and the bi in both cases are the same. As mentioned earlier, 
the available experimental observations are typically much fewer than the 
totality of possible system states. Thus, the objective in both cases is the 
same: Find good polynomial models for the full design from an experimental 
treatment of a fractional design. 

The approach to a solution is quite similar as well. Suppose we are given 
two transition functions / and g that both agree on the given experimental 
data, that is, f{pi) = hi = g{pi) for all i. Then (/ — g){pi) = 0, so that 
any two transition functions differ by a polynomial function that vanishes 
on all given observations, that is, by a polynomial in the ideal of points 
I(pi, . . . , pr), which we called in the previous section. If / is a partic- 
ular transition function that fits the data for some xi, then the space of all 
feasible models for xt is 

/-hI(pi,...,Pr)- 

The problem then is to choose a model from this space. In design of 
experiments, the single- variable monomials represent the main effects and 
the other monomials represent interactions. In the biochemical network case 
the situation is similar. Single- variable monomials in a model for a gene 
regulatory network represent the regulation of one gene by another, whereas 
the other monomials represent the synergistic regulation of one gene by a 
collection of other genes, for example through the formation of a protein 
complex. In general, very little theoretical information is available about 
the absence or presence of any given monomial in the model. One possible 
choice is to pick the normal form of / with respect to a particular Grobner 
basis for the ideal /(pi, . . . , Pr). However, this normal form depends on the 
particular choice of Grobner basis. Other approaches are explored in [? ], in 
particular an "averaging" process over several different choices of Grobner 
basis. 
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Example 3.3. Returning to our running example, consider the following poly- 
nomials: 

flixi,X2,X3) = 2X2X3 + 2X2 + 2X3, 

/2(X1,X2,X3) = 2X3 + X2 + X2 + 2X3 + 1, 

/3(xi,X2,X3) = 2x| + 2xi+2. 

Each fi interpolates the discretized data for X£ (see Example 13. 2p . The ideal 
of the input points pi , . . . , P4 is 

/ = (xi + X2 + 2, X2X3 + 2x| + 2x1 + X2,xl + 2x| + X2 + 2x3). 

Then the model space for each X£ is given by fi + /. The Grobner basis G 
for / with respect to the graded reverse lexicographical term order >~ with 
xi >- X2 >- X3 is 

G = {xi + X2 + 2, X2X3 + 2x| + X2 + 2x3, X2 + 2x3 + X2 + 2x3, X3 + 2x3}. 

To choose a model for each xg, we compute the normal form of fe with 
respect to y, resulting in the polynomial dynamical system F = {fi, f2, fs) '■ 
(F3)3 (F3)3 with 

/l(xi,X2,X3) = 2x| + X3 
/2(xi,X2,X3) = X3 + 2x3 + 1 
/3(xi,X2,X3) = 2x| + X2 + l. 

Given a polynomial model F = (/i, . . . , /„) for a network, one can pre- 
dict the connectivity structure of the nodes by analyzing the relationship 
between the variables and the transition functions. For example, the transi- 
tion function for xi given above is in terms of X3, but not the other variables. 
The interpretation is that regulation of the gene represented by xi is depen- 
dent only on X3. The dynamic behavior of the network can be simulated by 
evaluating F on all possible network states, that is, on all of /c". 

Definition 3.4. Let F = (/i, . . . , /„) : A;" — > /c" be a polynomial dynam- 
ical system. The wiring diagram of F is the directed graph {V, E) with 
V = {xi, . . . , x„} and E = {(xj, Xj) : Xj is a variable of /,}. The state space 
of F is the directed graph (V, E) with V = k"^ and E = {(a, -F(a) : a G A:"}. 

Viewing the structure and dynamics of a network via the wiring diagram 
and state space, respectively, allows one to uncover features of the network, 
including feedback loops and limit cycles, respectively (for example, see [3]). 

Example 3.5. The polynomial model F in Example 13.31 gives rise to the 
inferred wiring diagram and state space of the 3-gene network, as displayed 
in Figure [TJ The network is predicted to have a feedback loop between X2 and 
X3, and the expression of X3 is controlled via autoregulation. Furthermore, 
the network has two possible limit cycles: the fixed point at (0,1,1) and the 
3-cycle on (0,1,0), (0,1,2), and (1,0,1). The fixed point is considered to be an 
equilibrium state of the network, and the 3-cycle represents an oscillation. 
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Figure 1. Wiring diagram (top) and state space (bottom) 
for the polynomial model F in Example 13.31 

While the above polynomial dynamical system may be a reasonable model 
for the 3-gene network, it is not unique. We recall from Theorem 12.21 that 
the number of monomials in the basis for k[xi,X2,X3]/I{pi, . . . ,p4) is the 
number of data points (4, in this case). Since any transition function can be 
written as a /c-linear combination of the basis monomials, then for a fixed 
term order there are \k\"^ = 3^ possible transition functions where m is the 
number of data points. In fact there are (|A;|"*)" = 3^^ possible polynomial 
models, given a term order. As there are 5 term orders which produce 
distinct polynomial models0, there are ((|A;|™)"')^ = 3^*^ possible models for 
a 3-variable system on 3 states and 4 data points. 

An important problem in this context that is common to both design of 
experiments and biochemical network inference is the construction of good 
fractional designs that narrow down the model space as much as possible. 
The challenge in network inference is that experimental observations tend 
to be very costly, severely limiting the number of points one can collect. 
Furthermore, many points are impossible to generate biologically or experi- 
mentally, which provides an additional constraint on the choice of fractional 
design. 

4. Polynomial dynamical systems 

It is worth mentioning that polynomial dynamical systems over finite 
fields (not to be confused with dynamical systems given by differential equa- 
tions in polynomial form) have been studied in several different contexts. For 
instance, they have been used to provide state space models for systems for 

^We computed the marked Grobner bases of the ideal /(pi, . . . , P4) via the Grobner 
fan and then computed the normal forms of the interpolating polynomials in Example 13.31 
with respect to each of these Grobner bases to obtain the 5 distinct polynomial models. 
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the purpose of developing controllers [5|; la] in a variety of contexts, includ- 
ing biological systems Q]- Another use for polynomial dynamical systems is 
as a theoretical framework for agent-based computer simulations [9|]. Note 
that this class of models includes cellular automata and Boolean networks 
(choosing the field with two elements as state set), so that general poly- 
nomial systems are a natural generalization. In this context, an important 
additional feature is the update order of the variables involved. 

The dynamical systems in this paper have been updated in parallel, in 
the following sense. If / = (/i, . . . , /„) is a polynomial dynamical system 
and a G /c" is a state, then /(a) = (/i(a), . . . , /„(a)). By abuse of notation, 
we can consider each of the fi as a function on A;" which only changes the 
ith coordinate. If we now specify a total order of 1, . . . , n, represented as a 
permutation a (z Sn, then we can form the dynamical system 

fa = fa{n) ° /(T(n-l) o • ' ' o /^(i), 

which, in general, will be different from /. Thus, fa is obtained through 
sequential update of the coordinate functions. Sequential update of vari- 
ables plays an important role in computer science, e.g., in the context of 
distributed computation. See [§] for details. 

Many processes that can be represented as dynamical systems are in- 
trinsically stochastic, and polynomial dynamical systems can be adapted to 
account for this stochasticity. In the context of biochemical network mod- 
els, sequential update order arises naturally through the stochastic nature 
of biochemical processes within a cell that affects the order in which pro- 
cesses finish. This feature can be incorporated into polynomial dynamical 
system models through the use of random sequential update. That is, at 
each update step a sequential update order is chosen at random. It was 
shown in [l[ in the context of Boolean networks that such models reflect 
the biology more accurately than parallel update models. In [13] a stochas- 
tic framework for gene regulatory networks was proposed which introduces 
stochasticity into Boolean networks by choosing at each update step a ran- 
dom coordinate function for each variable, chosen from a probability space 
of update functions. Stochastic versions of polynomial dynamical systems 
have yet to be studied in detail and many interesting problems arise that 
combine probability theory, combinatorics, and dynamical systems theory, 
providing a rich source of cross-fertilization between these fields. 

5. Discussion 

This paper focuses on polynomial models in two fields, design of exper- 
iments and inference of biochemical networks. We have shown that the 
problem of inferring a biochemical network from a collection of experimen- 
tal observations is a problem in the design of experiments. In particular, the 
question of an optimal experimental design for the identification of a good 
model is of considerable importance in the life sciences. When focusing on 
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gene regulatory networks, it has been mentioned that conducting experi- 
ments is still very costly, so that the size of a fractional design is typically 
quite small compared to the number of factors to be considered. Another 
constraint on experimental design is the fact that there are many limits to 
an experimental design imposed by the biology, in particular the limited 
ways in which a biological network can be perturbed in meaningful ways. 
Much research remains to be done in this direction. 

An important technical issue we discussed is the dependence of model 
choices on the term order used. In particular, the term order choice affects 
the wiring diagram of the model which represents all the causal interaction 
among the model variables. Since there is generally no natural way to choose 
a term order this dependence cannot be avoided. We have discussed available 
modifications that do not depend on the term order, at the expense of only 
producing a wiring diagram rather a dynamic model. This issue remains a 
focus of ongoing research. 

As one example, an important way to collect network observations is as 
a time course of measurements, typically at unevenly spaced time intervals. 
The network is perturbed in some way, reacts to the perturbation, and then 
settles down into a steady state. The time scale involved could be on the 
scale of minutes or days. Computational experiments suggest that, from the 
point of view of network inference, it is more useful to collect several shorter 
time courses for different perturbations than to collect one highly resolved 
time course. A theoretical justification for these observations would aid in 
the design of time courses that optimize information content of the data 
versus the number of data points. 
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Appendix A. Concepts from computational algebra 

In this section, we let k denote a field and R the polynomial ring A^[x]^ , . . . , Xyi\ . 
A subset / C i? is an ideal if it is closed under addition and under multipli- 
cation by elements of R. 

Definition A.l. Let F be a finite set of points in fc". The set 

I{V) = {f eR: f{a) = for ah a G 

of polynomials that vanish on V is called the ideal of points ofV. 

Note that I(l^) is indeed an ideal. In fact, ^{V) is zero- dimensional since 
the fc-vector space R/\{V) is finite dimensional with dimk[R/\{V)) = \V\. 
While the number of generators of the vector space is fixed, the generators 
themselves depend on the choice of term order. 

Definition A. 2. A term order on i? is a relation >- on the set of monomials 
x° := x°^x'^ ■ ■ ■ x^" such that ;^ is a total ordering, 

x*^ ^ x^ =^ x"x^ y x^x^ 

for any monomial x"^, and ;^ is a well-ordering; i.e., every nonempty subset 
of monomials has a smallest element under y. 
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Given a term order every nonzero polynomial f R has a canonical 
representation as a formal sum of monomials 



with fi ^ R and ai £ k for i = 1, . . . ,r, and Uifi >- ajfj for all i > j. 
Moreover, ai/i is called the leading term of /. 

Definition A. 3. Let ;^ be a term order and I C R an ideal. A finite subset 
G = {gi, . . . ,gm} C / is a Grobner basis for / if the leading term of any 
/ € / is divisible by the leading term of some g^ under >- . The normal form 
of f G R with respect to G, denoted NF{f, G), is the remainder of / after 
division by the elements of G. 

Theorem A. 4. Every nonzero ideal I C R has a Grobner basis. 

Theorem A. 5. Let G be a Grobner basis for I C R and let f (z R. Then 
NF{f, G) is unique. 

Let G be a Grobner basis and LT[G) be the set of leading terms of the 
elements of G. The set {x'' : x'' ^ LT{G)} is a basis for R/l{V) and its 
elements are called standard monomials. Given a Grobner basis G of I(V^) 
with respect to a term order, every nonzero polynomial / € R/\(y) has a 
unique representation as a formal sum of the standard monomials. 

Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State 
University 

E-mail address, Reinhard Laubenbacher: reinhard@vbi.vt.edu 



r 




i=l 



Mathematical Biosciences Institute, The Ohio State University 
E-mail address, Brandilyn Stigler: bstigler@mbi.osu.edu 



